Learning (k,l)-context-sensitive probabilistic grammars with nonparametric Bayesian approach

نویسندگان

چکیده

Inferring formal grammars with nonparametric Bayesian approach is one of the most powerful for achieving high accuracy from unsupervised data. In this paper, mildly-context-sensitive probabilities, called (k, l)-context-sensitive are defined on context-free (CFGs). CFGs where probabilities rules identified contexts can be seen as a kind dual approaches distributional learning, in which characterize substrings. We handle data sparsity context-sensitive by smoothing effect hierarchical models such Pitman–Yor processes (PYPs). define hierarchy PYPs naturally augmenting infinite PCFGs. The blocked Gibbs sampling known to effective inferring show that, modifying inside able applied probabilistic grammars. At same time, we that time complexity CFG $$O(|V|^{l+3}|w|^3)$$ each sentence w, V set nonterminals. Since it computationally too expensive iterate sufficient times especially when |V| not small, some alternative algorithms required. Therefore, propose new method composite sampling, procedure separated into sub-procedures nonterminals and derivation trees. Finally, demonstrate inferred 0)-context-sensitive achieve lower perplexities than other language PCFGs, n-grams, HMMs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Application of the Variational Bayesian Approach to Probabilistic Context-Free Grammars

We present an efficient learning algorithm for probabilistic context-free grammars based on the variational Bayesian approach. Although the maximum likelihood method has traditionally been used for learning probabilistic language models, Bayesian learning is, in principle, less likely to cause overfitting problems than the maximum likelihood method. We show that the computational complexity of ...

متن کامل

Generalization in Context Sensitive Grammars

An alternative to the recognition systems used so far can be systems based on context sensitive grammars. A precondition for success will be an implementation with great capacity for generalization. We propose three methods of generalization based on the assumption that similar stimuli invoke similar or identical reactions.

متن کامل

Weakly Growing Context-Sensitive Grammars

This paper introduces weakly growing context-sensitive grammars. Abstract-1 Such grammars generalize the class of growing context-sensitive grammars (studied by several authors), in that these grammars have rules that “grow” according to a position valuation. If a position valuation coincides with the initial part of an expoAbstract-2 nential function, it is called a steady position valuation. ...

متن کامل

Attributed Context-Sensitive Graph Grammars

The paper introduces a concept of attributed context-sensitive graph grammars. The graph grammars are a graphical generalization of the textual grammars and can thus be used to specify the syntax of graphical programming or modeling languages. The attributed graph grammars extend the basic graph grammars with definitions of attributes and the associated attribute evaluation rules. By analogy to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Machine Learning

سال: 2021

ISSN: ['0885-6125', '1573-0565']

DOI: https://doi.org/10.1007/s10994-021-06034-2